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ABSTRACT 



Visualizing Internet web traffic is disclosed. In one 
embodiment, a number of windows are displayed, corre- 
sponding to a number of clusters into which users have been 
partitioned based on similar web browsing behavior. The 
windows are ordered from the cluster having the greatest 
number of users to the cluster having the least number of 
users. Each window has one or more rows, where each row 
corresponds to a user within the cluster. Each row has an 
ordered number of visible units, such as blocks, where each 
block corresponds to a web page visited by the user. The 
blocks can be color coded by the type of web page they 
represent. In one embodiment, the corresponding cluster 
models for the clusters are alternatively displayed in the 
windows. 

26 Claims, 6 Drawing Sheets 



404a 404b 404c 



404d 



402a- 
402b- 
402c- 




404e 



^410 



□□□EBB 




400. 



11/15/04, EAST Version: 2.0.1.4 



FIG.1 



SYST^MEMORY 
(ROM) 24 



BIOS 



^26 



J2 



OPERATING 
SYSTEM 



APPLICATION 
PROGRAMS 



PROGRAM 
MODULES 



PROGRAM 
DATA 



pa 

25 



20 





J* 




PROCESSING 




VIDEO 


UNIT 




ADAPTER 



SYSTEM BUS 



23 



HARD DISK 

DRIVE 
INTERFACE 



MAGNETIC 
DISK 
DRIVE 

INTERFACE 



OPTICAL 
DRIVE 
INTERFACE 



^000 ^ 



A6 



SERIAL 
PORT 
INTERFACE 





/ 




1 

\ E 














OPERATING 
SYSTEM 


APPUCATION 
PROGRAMS 


OTHER 
PROGRAM 
MODULES 


PROGRAM 
DATA 





NETWORK 
INTERFACE 



MONITOR 



LOCAL AREA 
NETWORK 



"X" 
51 



MODEM 



^IWIDE AREA NETWORK 



42 ^ * 



Ik 



REMOTE 
COMPUTER 



APPUCATION . 
PROGRAMS 



. .- ti n. $ 



11/15/04, EAST version: 2.0.1.4 



U.S. Patent Aug. 3, 2004 Sheet 2 of 6 



US 6,771,289 Bl 



FIG 2 




11/15/04, EAST Version: 2.0.1.4 



U.S. Patent Aug. 3, 2004 Sheet 3 of 6 US 6,771,289 Bl 



FIG 3 



304a 



304b 



304c 



.304d 





302a 


□□□□□ 


302b 




302c 


□□□ 

« 


302d 

• 


• 
■ 


* 
■ 

302n 



30( 



11/15/04, EAST Version: 2.0.1.4 



U.S. Patent Aug. 3, 2004 Sheet 4 of 6 US 6,771,289 Bl 




11/15/04, EAST Version: 2.0.1.4 



U.S. Patent Aug. 3, 2004 Sheet 5 of 6 US 6,771,289 Bl 



FIG 5 



PARTITION 
USERS INTO 
CLUSTERS 



504 



DISPLAY 
WINDOWS 



500, 



11/15/04, EAST Version: 2.0.1.4 



U.S. Patent Aug. 3, 2004 Sheet 6 of 6 



US 6,771,289 Bl 



FIG 6 



DISPLAY 
DEVICE 



COMPUTER 
PROGRAM 




11/15/04, EAST version: 2.0.1.4 



US 6,7 

1 

CLUSTER-BASED VISUALIZATION OF 
USER TRAFFIC ON AN INTERNET SITE 

FIELD OF THE INVENTION 

This invention relates generally to visualizing traffic on 
Internet world-wide-web sites, and more particularly to such 
visualization that is cluster-based in nature. 

BACKGROUND OF THE INVENTION 

Browsing the Internet has become an exceedingly popular 
application for computers and computer-like devices. In 
particular, sites on the wo rid -wide -web (WWW, or web) 
have seen increasing visitation. A web site includes a num- 
ber of web pages, identified by Uniform Resource Locator 
(URL) addresses. For example, a home page of a web site 
may be addressed as http://www.mypage.com/index.html. 

As web sites have become more popular, tracking their 
usage has becoming important, especially for commercial 
sites. Operators of web sites, for example, are interested in 
determining how users proceed through the site in a single 
session. A session is when a user views one or more pages 
of the web site, in a particular order, before going to a 
different web site. By being able to determine how users are 
proceeding through their sites, operators are able to better 
understand their users, and how to make their web sites 
better. 

For this and other reasons, therefore, there is a need for 
the present invention. 

SUMMARY OF THE INVENTION 

The invention relates to visualizing web traffic in a 
cluster-based manner. In one embodiment, a number of 
windows are displayed, corresponding to a number of clus- 
ters into which users have been partitioned based on similar 
web browsing behavior. The windows are ordered from the 
cluster having the greatest number of users to the cluster 
having the least number of users. Each window has one or 
more rows, where each row corresponds to a user within the 
cluster. Each row has an ordered number of visible units, 
such as blocks, where each block corresponds to a web page 
visited by the user. The blocks can be color coded by the 
type of web page they represent. In one embodiment, the 
corresponding cluster models for the clusters are alterna- 
tively displayed in the windows. The windows desirably can 
be made larger or smaller, as well as be scrolled through via 
scroll bars. 

Thus, embodiments of the invention allow web site opera- 
tors to view first-hand how users are visiting their web sites. 
The visual approach of embodiments of the invention pro- 
vides for advantages as compared to non-visual approaches, 
since usage data for web sites can be increasingly large. 
Without a visual approach, for instance, it is more difficult 
for operators to make conclusions as to usage of their web 
sites, as compared to using a visual approach. 

The invention includes computer-implemented methods, 
machine-readable media, computerized systems, and com- 
puters of varying scopes. Other aspects, embodiments and 
advantages of the invention, beyond those described here, 
will become apparent by reading the detailed description and 
with reference to the drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a diagram of an operating environment in 
conjunction with which embodiments of the invention can 
be practiced; 
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FIG. 2 is a diagram of a display device on which a number 
of windows can be shown in accordance with an embodi- 
ment of the invention; 

FIG. 3 is a diagram of a representative such window in 
5 accordance with an embodiment of the invention; 

FIG. 4 is a diagram of a window showing a corresponding 
model for a cluster, according to an embodiment of the 
invention; 

FIG. 5 is a flowchart of a method according to an 
10 embodiment of the invention; and, 

FIG. 6 is a diagram of a system according to an embodi- 
ment of the invention. 

DETAILED DESCRIPTION OF THE 
15 INVENTION 

In the following detailed description of exemplary 
embodiments of the invention, reference is made to the 
accompanying drawings which form a part hereof, and in 

20 which is shown by way of illustration specific exemplary 
embodiments in which the invention may be practiced. 
These embodiments are described in sufficient detail to 
enable those skilled in the art to practice the invention, and 
it is to be understood that other embodiments may be utilized 

25 and that logical, mechanical, electrical and other changes 
may be made without departing from the spirit or scope of 
the present invention. The following detailed description is, 
therefore, not to be taken in a limiting sense, and the scope 
of the present invention is defined only by the appended 

30 claims. 

Some portions of the detailed descriptions which follow 
are presented in terms of algorithms and symbolic repre- 
sentations of operations on data bits within a computer 
memory. These algorithmic descriptions and representations 

35 are the means used by those skilled in the data processing 
arts to most effectively convey the substance of their work 
to others skilled in the art. An algorithm is here, and 
generally, conceived to be a self-consistent sequence of steps 
leading to a desired result. The steps are those requiring 

40 physical manipulations of physical quantities. Usually, 
though not necessarily, these quantities take the form of 
electrical or magnetic signals capable of being stored, 
transferred, combined, compared, and otherwise manipu- 
lated. 

45 It has proven convenient at times, principally for reasons 
of common usage, to refer to these signals as bits, values, 
elements, symbols, characters, terms, numbers, or the like. It 
should be borne in mind, however, that all of these and 
similar terms are to be associated with the appropriate 

50 physical quantities and are merely convenient labels applied 
to these quantities. Unless specifically stated otherwise as 
apparent from the following discussions, it is appreciated 
that throughout the present invention, discussions utilizing 
terms such as processing or computing or calculating or 

55 determining or displaying or the like, refer to the action and 
processes of a computer system, or similar electronic com- 
puting device, that manipulates and transforms data repre- 
sented as physical (electronic) quantities within the com- 
puter system's registers and memories into other data 

60 similarly represented as physical quantities within the com- 
puter system memories or registers or other such informa- 
tion storage, transmission or display devices. 
Operating Environment 

Referring to FIG. 1, a diagram of the hardware and 

65 operating environment in conjunction with which embodi- 
ments of the invention may be practiced is shown. The 
description of FIG. 1 is intended to provide a brief, general 
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description of suitable computer hardware and a suitable personal computer through input devices such as a keyboard 

computing environment in conjunction with which the 40 and pointing device 42. Other input devices (not shown) 

invention may be implemented. Although not required, the may include a microphone, joystick, game pad, satellite 

invention is described in the general context of computer- dish, scanner, or the like. These and other input devices are 

executable instructions, such as program modules, being 5 often connected to the processing unit 21 through a serial 

executed by a computer, such as a personal computer. port interface 46 that is coupled to the system bus, but may 

Generally, program modules include routines, programs, be connected by other interfaces, such as a parallel port, 

objects, components, data structures, etc., that perform par- game port, or a universal serial bus (USB). A monitor 47 or 

ticular tasks or implement particular abstract data types. other type of display device is also connected to the system 

Moreover, those skilled in the art will appreciate that the 10 bus 23 via an interface, such as a video adapter 48, In 

invention may be practiced with other computer system addition to the monitor, computers typically include other 

configurations, including hand-held devices, multiprocessor peripheral output devices (not shown), such as speakers and 

systems, microprocessor-based or programmable consumer printers. 

electronics, network PC's, minicomputers, mainframe The computer may operate in a networked environment 

computers, and the like. The invention may also be practiced is using logical connections to one or more remote computers, 

in distributed computing environments where tasks are such as remote computer 49. These logical connections are 

performed by remote processing devices that are linked achieved by a communication device coupled to or a part of 

through a communications network. In a distributed com- the computer; the invention is not limited to a particular type 

puting environment, program modules may be located in of communications device. The remote computer 49 may be 

both local and remote memory storage devices. 20 another computer, a server, a router, a network PC, a client, 

The exemplary hardware and operating environment of a peer device or other common network node, and typically 

FIG. 1 for implementing the invention includes a general includes many or all of the elements described above 

purpose computing device in the form of a computer, relative to the computer, although only a memory storage 

including a processing unit 21, a system memory 22, and a device has been illustrated in FIG. 1. The logical connec- 

system bus 23 that operatively couples various system 25 tions depicted in FIG. 1 include a local-area network (LAN) 

components include the system memory to the processing 51 and a wide-area network (WAN) 52. Such networking 

unit 21. There may be only one or there may be more than environments are commonplace in office networks, 

one processing unit 21, such that the processor of computer enterprise- wide computer networks, intranets and the 

comprises a single central-processing unit (CPU), or a Internal, which are all types of networks, 
plurality of processing units, commonly referred to as a 30 When used in a LAN-networking environment, the com- 

parallel processing environment. The computer may be a puter is connected to the local network 51 through a network 

conventional computer, a distributed computer, or any other interface or adapter 53, which is one type of communica- 

type of computer; the invention is not so limited. tions device. When used in a WAN-networking 

The system bus 23 may be any of several types of bus environment, the computer typically includes a modem 54, 

structures including a memory bus or memory controller, a 35 a type of communications device, or any other type of 

peripheral bus, and a local bus using any of a variety of bus communications device for establishing communications 

architectures. The system memory may also be referred to as over the wide area network 52, such as the Internet. The 

simply the memory, and includes read only memory (ROM) modem 54, which may be internal or external, is connected 

24 and random access memory (RAM). A basic input/output to the system bus 23 via the serial port interface 46. In a 

system (BIOS) 26, containing the basic routines that help to 40 networked environment, program modules depicted relative 

transfer information between elements within the computer, to the personal computer, or portions thereof, may be stored 

such as during start-up, is stored in ROM 24. The computer in the remote memory storage device. It is appreciated that 

further includes a hard disk drive 27 for reading from and the network connections shown are exemplary and other 

writing to a hard disk, not shown, a magnetic disk drive 28 means of and communications devices for establishing a 

for reading from or writing to a removable magnetic disk 29, 45 communications link between the computers may be used, 

and an optical disk drive 30 for reading from or writing to Cluster-based Web Traffic Visualization Techniques 
a removable optical disk 31 such as a CD ROM or other In this section of the detailed description, various 

optical media. approaches and techniques for visualizing web site activity 

The hard disk drive 27, magnetic disk drive 28, and in a cluster-based manner are described. In the diagram of 

optical disk drive 30 are connected to the system bus 23 by 50 FIG. 2, for instance, on the display device 200 a series of 

a hard disk drive interface 32, a magnetic disk drive inter- windows 202a, 2026, 202c, . . . , 202a", 202m and 202/j is 

face 33, and an optical disk drive interface 34, respectively. shown. Each of the series of windows corresponds to a 

The drives and their associated computer-readable media cluster into which users of a given web site or web sites have 

provide nonvolatile storage of computer-readable been partitioned based on similar web browsing behavior 

instructions, data structures, program modules and other 55 through the site(s). A cluster is ' generally defined in a 

data for the computer. It should be appreciated by those nonrestrictive and a non-limiting manner herein as a group 

skilled in the art that any type of computer-readable media of like items that have been associated with one another 

which can store data that is accessible by a computer, such based on one or more common or similar characteristics via 

as magnetic cassettes, flash memory cards, digital video a predetermined technique. For example, in one embodiment 

disks, Bernoulli cartridges, random access memories 60 of the invention, an Expectation Maximization (EM) 

(RAMs), read only memories (ROMs), and the like, may be approach is used, as known within the art, to yield a mixture 

used in the exemplary operating environment. model of first-order Markov models, as also known within 

A number of program modules may be stored on the hard the art, where each cluster has a corresponding first -order 

disk, magnetic disk 29, optical disk 31, ROM 24, or RAM, Markov model. In another embodiment, an Expectation 

including an operating system 35, one or more application 65 Maximization (EM) approach is used, as known within the 

programs 36, other program modules 37, and program data art, to yield a mixture model of zero -order Markov models, 

38. A user may enter commands and information into the as also known within the art, where each cluster has a 
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corresponding zero-order Markov model. However, the value by their shading intensity. In the diagram of FIG. 4, 

invention is not so limited. The windows are desirably however, the shading intensity is indicated as a number from 

ordered from the cluster having the greatest number of users 0 to 5 within a block, where 0 indicates dark, and 5 indicates 

therein to the cluster having the least number of users light. Thus, the block 406 has an intensity level 5, the block 

therein. Each of the windows can desirably be increased in 5 408 has an intensity level 0, the block 410 has an intensity 

size, also known as maximized, so that more rows of the leve j 0 , and the block 412 has an intensity level 0, etc. This 

window is shown on the device 200. Furthermore, desirably nomenclature is for illustrative clarity only, however, and the 

each of the windows can have scroll bars added thereto, as invention fc not ^ limitecL Furthermore, the number of 

known within the art, so that the data within the window is intensity levels is also not limited by the invention, 

navigable via the bars 1Q £ach rQW has in the column mb a visible unit that has 

In the diagram of FIG. 3, a representative window 300, an ^ indicating the frequency with which users 

that could be any of the series of windows, is shown. The . . . 4 ' c . - 4 4 c u *u 

window has at least one row, such as the rows 302a, 3026, visit mat P ar < of * he Slte ' T ° r l ™° of ™* P a ^f ™ Sl *> 

302c, 302rf, - - . , 302/z. Each row corresponds to a particular represented by the row In one embodiment, this is the 

session of a particular user, where the sessions of the users probability of visiting that type of web page on the site 

in a given cluster are grouped together based on similar web ™ conditioned on inclusion in the cluster represented by the 

browsing behavior. Each row has an ordered number of window, or, p(part|cluster). For example, the block 406 of 

visible units, such as, in one embodiment, blocks, although the row 402a has the maximum intensity level of 5, indi- 

the invention is not so limited. As an example, the row 302a eating that this probability is high. Each row also has in the 

has ordered blocks 304a, 3046, 304c and 304o\ Each of the column 404c a visible unit that has an intensity level 

ordered visible units of a row represents a page of the web 20 indicating the frequency with which a user's first visit is to 

site visited by the user of the row. For example, the row 302a the part of the site represented by the row. For example, the 

indicates that in this session, a user has visited four pages of block 408 of the row 402a has the minimum intensity level 

a site. of 0, indicating that users within this cluster are not likely to 

Desirably, the visible units are coded, such as by color, to visit this part of the site — that is, this type of web page of the 

indicate the different types of pages on the site. The types of 25 site — first, 

web pages are referred to herein synonymously as the Furthermore, each row has in the column 404a 1 a number 

different parts of the site that can be visited by the user. Thus, of visible units that have intensity levels indicating the 

for example, for a news-related web site, the visible units transition probabilities of the first-order Markov model, such 

may be color coded as to the particular parts of the site, such as p(part i at time t+1, p part j at time t), etc., as can be 

as local news encompassing local news-related pages, global 30 appreciated by those of ordinary skill within the art. 

news encompassing global news-related pages, financial Alternatively, the conditional transition probabilities, p(part 

news encompassing financial news-related pages, sports i at time t+1 |p part j at time t) can be indicated by the visible 

news encompassing sports news-related pages, entertain- units within the column 404a*. Finally, each row has in the 

ment news encompassing entertainment news-related pages, column 404e a visible unit that has an intensity level 

etc. Thus, the path of a given user session within a given 35 indicating the probability at which the user leaves the site 

cluster through a web site can easily be determined via from that part of the site, or type of web page on the site, 

examination of the number and order of the visible units, as represented by the row. For example, the block 412 of the 

well as their color. Furthermore, in one embodiment, the row 402a has the minimum intensity level of 0, indicating 

rows within a window can be ordered based on a predeter- that users within the cluster represented by the window 400 

mined typicality measure of the users to which the rows 40 are unlikely to leave the web site from the part of the site 

correspond. In one embodiment, the predetermined measure represented by the row 402a. 

is the probability of behavior based on location within a Methods and Systems 

cluster, or, p(behavior|cluster). In this section of the detailed description, methods and 
As has been described, in one embodiment, each cluster systems according to varying embodiments of the invention 
to which a window corresponds has a corresponding first- 45 are described. In some embodiments, the methods in par- 
order Markov model. Therefore, in one embodiment, a ticular are computer-implemented. The computer- 
selected window or windows, or all the windows, can be implemented methods can be realized at least in part as one 
made to display the corresponding Markov model(s) for or more programs running on a computer — that is, as a 
these window(s), in lieu of the ordered rows corresponding program executed from a computer-readable medium such 
to user sessions of a web site. This is shown in FIG. 4. In the 50 as a memory by a processor of a computer, such as the 
window 400, there is a series of rows, such as the rows 402a, computer shown in and described in conjunction with FIG, 
4026 and 402c. Each row corresponds to a different type of 1. The programs are desirably storable on a machine- 
web page on the site(s) that can be visited by the user. This readable medium such as a floppy disk or a CD-ROM, for 
is also synonymously referred to herein as a different part of distribution and installation and execution on another com- 
the site(s) that can be visited by the user. It is noted that only 55 puter. 

three rows are shown in the window 400 for sake of Referring to FIG. 5, a flowchart of a method 500 accord- 
illustrative clarity. The rows are divided into a number of ing to an embodiment of the invention is shown. In 502, 
columns, such as the columns 404a, 4046, 404c, 404d and users are partitioned into clusters, based on similar web 
404e. Each row has a label corresponding to a type of web browsing behavior thereof, as has been described in the 
page in the first column 404a. Thus, the row 402a has the 60 previous section of the detailed description. In 504, windows 
label news, encompassing news-related web pages of the are displayed on a display device that correspond to the 
site(s), the row 4026 has the label ent, for entertainment, clusters, in a manner consistent with as has also been 
encompassing entertainment-related web pages of the described in the previous section of the detailed description. 
site(s), the row 402c has the label money, encompassing Referring next to FIG. 6, a diagram of a system 600 
money — or finance -related web pages of the site(s), etc. 65 according to an embodiment of the invention is shown. The 
Each row has in the columns 4046, 404c, 404a* and 404e system 600 can in one embodiment be implemented within 
one or more visible units, such as blocks, that indicate a the operating environment described previously in the 



11/15/04, EAST Version: 2.0.1.4 



US 6,771,289 Bl 



8 



IS 



detailed description in conjunction with FIG. 1. The system 
600 includes a display device 602 and a computer program 
604. The device 602 can be a computer monitor, such as a 
flat-panel display (FPD) or a cathode-ray tube (CRT) 
display, as known within the art, in one embodiment. The 5 
program 604 is designed to display windows on the device 
602 corresponding to clusters into which users have been 
partitioned based on similar browsing behavior of the users, 
as has been described in the previous section of the detailed 
description. In one embodiment, the program 604 can be 
considered the means for accomplishing this functionality. 
Furthermore, the program 604 can be executed from a 
computer-readable medium, such as a hard disk drive or a 
memory, by a processor in one embodiment. 
Conclusion 

Although specific embodiments have been illustrated and 
described herein, it will be appreciated by those of ordinary 
skill in the art that any arrangement which is calculated to 
achieve the same purpose may be substituted for the specific 
embodiments shown. This application is intended to cover 2Q 
any adaptations or variations of the present invention. 
Therefore, it is manifestly intended that this invention be 
limited only by the claims and equivalents thereof 

We claim: 

1. A computer-implemented method comprising: 25 
partitioning a plurality of users into a plurality of clusters 

based on similar web browsing behavior of the users; 
and 

displaying a plurality of windows on a display device 
corresponding to the plurality of clusters; 30 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to and 
visually indicating a type of web page visited by the 35 
user, and the plurality of windows are hierarchically 
ordered as a function of corresponding clusters and 
users per respective cluster. 

2. The method of claim 1, wherein the plurality of 
windows are ordered from a window corresponding to a 40 
cluster having a greatest number of users to a window 
corresponding to a cluster having a least number of users. 

3. The method of claim 1, wherein partitioning the plu- 
rality of users into the plurality of clusters comprises utiliz- 
ing a mixture model of first-order Markov models based on 45 
an Expectation Maximization (EM) approach. 

4. The method of claim 1, wherein partitioning the plu- 
rality of users into the plurality of clusters comprises utiliz- 
ing a mixture model of zero -order Markov models based on 

an Expectation Maximization (EM) approach. 50 

5. The method of claim 3, further comprising displaying 
a corresponding first-order Markov model for a cluster 
within the window to which the cluster corresponds. 

6. The method of claim 3, further comprising displaying 

a corresponding zero -order Markov model for a cluster 55 
within the window to which the cluster corresponds. 

7. The method of claim 1, wherein each window can be 
enlarged such that a greater number of the at least one row 
of the window is visible. 

8. The method of claim 1, wherein the at least one row in go 
each window are ordered based on a predetermined typical- 
ity measure of the users to which the at least one row 
correspond. 

9. A computer-implemented method comprising: 
partitioning a plurality of users into a plurality of clusters 65 

based on similar web browsing behavior of the users; 
and 



displaying a plurality of windows on a display device 
corresponding to the plurality of clusters; 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to a type 
of web page visited by the user, and each visible unit 
comprises a colored block, such that the colored block 
has a color corresponding to the type of web page to 
which the colored block corresponds. 

10. A machine-readable medium having instructions 
stored thereon for execution by a processor to perform a 
method comprising: 

displaying a plurality of windows on a display device 
corresponding to a plurality of clusters into which a 
plurality of users have been partitioned based on simi- 
lar web browsing behavior thereof, 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to and 
visually indicating a type of web page visited by the 
user, and the plurality of windows are hierarchically 
ordered based on number of users per cluster. 

11. The medium of claim 10, wherein the plurality of 
windows are ordered from a window corresponding to a 
cluster having a greatest number of users to a window 
corresponding to a cluster having a least number of users. 

12. The medium of claim 10, the method initially com- 
prising partitioning the plurality of users into the plurality of 
clusters based on the similar web browsing behavior of the 
users. 

13. The medium of claim 12, wherein partitioning the 
plurality of users into the plurality of clusters comprises 
utilizing a mixture model of first-order Markov models 
based on an Expectation Maximization (M) approach. 

14. The medium of claim 13, further comprising display- 
ing a corresponding first-order Markov model for a cluster 
within the window to which the cluster corresponds. 

15. The medium of claim 10, wherein each window can 
be enlarged such that a greater number of the at least one row 
of the window is visible. 

16. The medium of claim 10, wherein the at least one row 
in each window are ordered based on a predetermined 
typicality measure of the users to which the at least one row 
correspond. 

17. A machine-readable medium having instructions 
stored thereon for execution by a processor to perform a 
method comprising: 

displaying a plurality of windows on a display device 
corresponding to a plurality of clusters into which a 
plurality of users have been partitioned based on simi- 
lar web browsing behavior thereof, 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to and 
visually indicating a type of web page visited by the 
user, and each visible unit comprises a colored block, 
such that the colored block has a color corresponding to 
the type of web page to which the colored block 
corresponds. 

18. A computerized system comprising: 
a display device; 

a computer program designed to display a plurality of 
windows on the display device corresponding to a 
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plurality of clusters into which a plurality of users have 
been partitioned based on similar web browsing behav- 
ior of the users; 
wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to and 
visually indicating a type of web page visited by the 
user, and the plurality of windows are hierarchically 
ordered as a function of corresponding clusters and 
users per respective cluster. 

19. The system of claim 18, wherein the plurality of 
windows are ordered from a window corresponding to a 
cluster having a greatest number of users to a window 
corresponding to a cluster having a least number of users. 

20. The system of claim 18, wherein each window can be 
enlarged such that a greater number of the at least one row 
of the window is visible. 

21. The system of claim 18, wherein the at least one row 
in each window are ordered based on a predetermined 
typicality measure of the users to which the at least one row 
correspond. 

22. A computerized system comprising: 
a display device; 

a computer program designed to display a plurality of 
windows on the display device corresponding to a 
plurality of clusters into which a plurality of users have 
been partitioned based on similar web browsing behav- 
ior of the users; 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to and 
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visually indicating a type of web page visited by the 
user, and each visible unit comprises a colored block, 
such that the colored block has a color corresponding to 
the type of web page to which the colored block 
corresponds. 

23. A computerized system comprising: 
a display device; 

means for displaying a plurality of windows on the 
display device corresponding to a plurality of clusters 
into which a plurality of users have been partitioned 
based on similar web browsing behavior of the users, 

wherein each window has at least one row, each row 
corresponding to a user within the cluster to which the 
window corresponds and having an ordered number of 
visible units, each visible unit corresponding to a type 
of web page visited by the user, and the plurality of 
windows are ordered from a window corresponding to 
a cluster having a greatest number of users to a window 
corresponding to a cluster having a least number of 
users. 

24. The system of claim 23, wherein each visible unit 
comprises a colored block, such that the colored block has 
a color corresponding to the type of web page to which the 
colored block corresponds. 

25. The system of claim 23, wherein each window can be 
enlarged such that a greater number of the at least one row 
of the window is visible. 

26. The system of claim 23, wherein the at least one row 
in each window are ordered based on a predetermined 
typicality measure of the users to which the at least one row 
correspond. 
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